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In biology experiments, oligonucleotide microarrays are contacted with a solution of long nucleic 
acid (NA) targets. The hybridized probes thus carry long tails. When the surface density of the 
■ oligonucleotide probes is high enough, the progress of hybridization leads to the formation of a poly- 

electrolyte brush due to mutual crowding of the NA tails. The free energy penalty associated with the 
brush modifies both the hybridization isotherms and the rate equations: the attainable hybridization 
is lowered significantly as is the hybridization rate. While the equilibrium hybridization fraction, x eq , 
' is low, the hybridization follows a Langmuir type isotherm, x eg /(l — x eq ) = c t K where c t is the target 

concentration and K is the equilibrium constant smaller than its bulk value by a factor (n/N) 2 ^ 5 due 
. to wall effects where n and N denote the number of bases in the probe and the target. At higher x eq , 

when the brush is formed, the leading correction is x eq /(l — x eq ) = c t K exp[— const' (x 2 q 3 — x 2 J 3 )] 
where xb corresponds to the onset of the brush regime. The denaturation rate constant in the two 
regimes are identical. However, the hybridization rate constant in the brush regime is lower, the 
leading correction being exp[— const' (x 2 ^ 3 — x 2 J 3 )]. 
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INTRODUCTION 



The growing availability of genomic DNA sequences enables research on profiles of gene expression, single nucleotide 
polymorphism (SNP) and their role, molecular diagnostics for cancer etc. In turn, these activities require simultaneous 
' interrogation of a given sample for the presence of numerous different nucleic acid sequences. DNA microarrays, "DNA 
chips", emerged as an important method for such parallel analysis (Graves, 1999; Wang, 2000; Lockhart and Winzeler, 
2000; Heller, 2002). DNA chips function by parallel hybridization of labelled nucleic acid sequences in the solution, 
known as targets, to an array of nucleic acid probes bound to a surface. Numerous identical probes are localized at a 
small area known as "spot" or "probe cell" . The composition of the sample is deduced from the label intensities of the 
different spots after the hybridization. DNA chips are produced in one of two main formats. In cDNA microarrays, long 
cDNA targets are physisorbed onto the substrate while in oligonucleotide chips short oligonucleotides are chemically 
bound to the surface via their terminal groups. Our theoretical considerations address the hybridization behavior, 
kinetics and thermodynamics, of oligonucleotide microarrays when the targets are much longer than the probes as is 
typically the case in biology experiments (See for examples: Guo et al., 1994; Prix et al., 2002). In particular, we 
analyze the consequences of the interactions between the long hybridized targets at the surface (Fig.l). 

A growing theory effort aims to clarify the underlying physics of DNA chips with view of assisting in their design 
and in the analysis of the results. The Langmuir isotherm and the corresponding kinetic scheme provide a natural 
starting point for the modeling (Chan et al., 1995; Livshits and Mirzabekov, 1996; Vainrub and Pettitt, 2002; Bhanot 
et al., 2003; Held et al., 2003; Zhang et al., 2003; Halperin et al., 2004a, 2004b) as well as the analysis of the 
experimental results (Forman et al., 1998; Okahata et al., 1998; Steel et al., 1998; Georgiadis et al., 2000; Nelson et 
al., 2001; Dai et al., 2002; Kepler et al., 2002; Peterson et al., 2002; Hekstra et al., 2003). Within this model, the 
probes, irrespective of their hybridization state, do not interact. This assumption is justified when the probe density 
in the spots is sufficiently low. At higher probe densities interactions are no longer negligible and the Langmuir model 
requires modifications. As we shall discuss, the necessary modifications depend crucially on the length of the targets 
as characterized by TV, the number of bases or monomers. Importantly, in biology experiments the targets are usually 
significantly longer than the probes. As a result, each hybridized probe binds a long segment of single stranded nucleic 
acid formed by the unhybridized part of the target (Fig. 1). This leads to two effects. First, when the tails do not 
overlap the hybridization at an impenetrable surface incurs an entropic penalty. This reduces the equilibrium constant 
of hybridization with repect to its bulk value. Second, it is necessary to allow for the crowding of these unhybridized 
"tails" as the fraction of hybridized probes grows. This crowding gives rise to a polymer brush, a phenomenon that 
was extensively studied in polymer physics (Milner, 1991; Halperin et al., 1992; Riihe et al., 2004). The theory of 
polyelectrolyte brushes (Pincus, 1991; Borisov et al., 1991; Riihe et al., 2004), as modified to allow for target-probe 
interactions and wall effects, enables us to analyze the effects of the crowding on the thermodynamics and kinetics of 
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hybridization on DNA chips. In particular, we obtain the hybridization isotherm and the rate equation in the brush 
regime when the unhybridized tails overlap. As we shall see, the free energy penalty associated with the brush gives 
rise to distinctive modification of the Langmuir isotherm and kinetics. Importantly, the brush penalty reflects both 
the electrostatic interactions within the probe layer and the entropic price due to the extension of the crowded chains. 
It results in slower hybridization and lower attainable hybridization. 

Our analysis focuses on oligonucleotide microarrays hybridizing with long targets of single stranded (ss) DNA. 
For simplicity we limit the discussion to the experimentally attainable case of monodispersed targets and probes, a 
passivated surface that eliminates physical adsorption of DNA and probes anchored to the surface via short spacer 
chains. The qualitative features of our results apply however to a wider range of systems. Two hybridization regimes 
appear, depending on the equilibrium hybridization fraction, x eq , as set by the bulk concentration of the target, c t . 
For low x eq , the hybridization isotherm is of the Langmuir form, x eq /(l — x eq ) = c t K where K is the equilibrium 
constant of the hybridization reaction at the surface. For probes comprising n <C N bases K at an impenetrable 
surface is reduced by a factor of (n/A) 2 / 5 with respect to the equilibrium constant of the free chains in solution. At 
higher x eq , obtained at higher a, the effective equilibrium constant is modified because of the brush penalty. The 

leading correction to the hybridization isotherm is x eq /(l — x eq ) — c t K exp[— const' (xe q 3 — ig 3 )] where xb corresponds 
to the onset of brush formation. The formation of the brush does not affect the denaturation rate constant of the 
hybridized probe. However, it does lower the hybridization rate constant by a factor of cxp[— const 1 (a; 2 / 3 — x 2 J 3 )] 

where x is the instantaneous hybridization fraction. The proportionality constant scales with A/E 2 / 3 where S is the 
area per probe. 

To our knowledge, there has been no direct experimental study of the effects of brush formation on the hybridization 
isotherms and the hybridization rates. Yet, experimental evidence of brush effects has been reported. Guo et al. (1994) 
observed that the maximum attainable hybridization fraction is reached at higher So when N increases. Su et al. 
(2002) reported slower hybridization as N increases at fixed S . A similar effect was reported for RNA targets by 
Dai et al. (2002). Further support for the existence of the brush effect is lent by the wide spread use of sample 
fragmentation to achieve a lower average N (See for example: Rosenow et al., 2001; Affymetrix, 2004). 

The practical implications of our analysis concern three issues: the design of DNA chips, the sample preparation and 
the analysis of the data. The design of DNA chips currently reflects the view that an increase in the oligonucleotide 
density in a spot should increase the signal intensity and therefore the sensitivity (Pirrung, 2002). Certain limitations 
of this strategy, due to the increase of the DNA diameter upon hybridization and the resulting steric hindrance, has 
been long recognized (Southern et al., 1999). In marked distinction, our analysis highlights limitations due to the 
crowding of the long non-hybridized tails of the targets. Thus, in choosing S it is useful to bear in mind the anticipated 
N of the sample and its effect on the attainable hybridization. When So is fixed, our analysis provides guidelines 
for the sample preparation. In particular, the choice of N as determined by the PCR primers or the fragmentation 
procedure. Concerning data analysis, our discussion identifies possible sources of error when comparing spot intensities 
of samples with different N. These may occur because both the onset of saturation and the hybridization rate vary 
with N. In quantitative terms, our analysis yields two guidelines: Concerning equilibrium hybridization, it leads 
to a simple relationship between the area per probe, So, the number of bases in the probe, n, the number of bases 
in the target, N, and the attainable sensitivity as measured by C50 i.e., the target concentration resulting in 50% 
hybridization at the spot. Regarding the kinetics, it yields a simple criterion for the onset of slowdown due to the 
brush formation. 

Experiments using DNA chips involve many control parameters concerning the chip design, the sample preparation 
and the hybridization conditions. These are outlined in Design of Oligonucleotide Microarray Experiments together 
with a discussion of the resulting hybridization regimes and the choice of parameters used in our numerical calculations. 
Our analysis incorporates ingredients from the theory of polymer brushes. These are summarized in Background on 
Polymer Brushes. This section describes the Flory version of the Alexander model of brushes as applied to terminally 
anchored polyelectrolytes in aquaous solution of high ionic strength. The model is modified to incorporate the effect 
of an impenetrable grafting surface. This is important in order to ensure crossover to the mushroom regime, of non- 
overlapping tails, and to enable comparison of the hybridization constants at the surface and in the bulk. Since the 
hybridization site is typically situated within the target, each hybridized probe carries two unhybridized tails. The 
necessary modifications are also discussed. When brush formation is possible, the hybridized targets also interact with 
neighboring probes. The resulting free energy penalty, within the Flory approximation, is described in Target-Probe 
Interactions. The free energies associated with the brush and with the target-probe interactions enable us to obtain the 
equilibrium hybridization isotherms. The derivation is discussed in Brush Effects-Thermodynamics of Hybridization. 
The hybridization isotherms allow to quantify the sensitivity in terms of the corresponding C50. In turn, these yield 
design guidelines relating the sensitivity to n, N and So- Assuming, and later checking, that the hybridization rate 
at the surface is reaction controlled enables us to specify the hybridization and denaturation rate constants in the 
different regimes. The necessary background, on the hybridization kinetics in the bulk and the desorption dynamics 
out of a brush, as well as the resulting rate equations are discussed in Brush Effects-Kinetics of Hybridization. The 
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second virial coefficient, v, specifying the interactions between the charged monomers of polyelectrolytes in the high 
salt regime is discussed in Appendix A. Using this v, we recover our earlier results of the n — N case and discuss 
the comparison to the N 3> n scenario. In Appendix B we present an alternative derivation of our result for the 
hybridization contant in the low surface density regime. This utilizes exact results thus avoiding the approximations 
inherent to the "Alexander-Flory" approximation. 

DESIGN OF OLIGONUCLEOTIDE MICROARRAY EXPERIMENTS 

Oligonucleotide chip experiments vary widely in their design. A brief summary of the possible designs is necessary 
in order to delineate the range of applicability of our analysis and the different possible regimes. To this end it is 
helpful to distinguish between three groups of design parameters: The chip design, the sample characteristics and 
the hybridization conditions. The primary parameters in the chip design (Pirrung, 2002) are the area per probe, So, 
and the number of bases in the probe, n. n values in the range 10 to 30 are typical. In this context one should 
discriminate two approaches to the production of oligonucleotide chips. In one, the probes are synthesized in situ 
using photolithography. In the other, pre-synthesized oligonucleotides with functionalizcd end groups are delivered 
to the spot. In the first approach it is necessary to allow for the production of incomplete sequences leading to 
polydispersity in n (Forman et al., 1998). The reported probe densities within spots vary between 1.2 x 10 10 and 
4 x 10 13 probes per cm 2 corresponding to 2.5 x 10 2 A 2 < So < 8.3 x 10 5 A 2 . The chip characteristics also include the 
nature of the surface treatment used to minimize non-specific adsorption and of the spacer chains joining the probe 
to the anchoring functionality (length, charge, hydrophobicity, etc.). 

A key qualitative characteristic of the sample is the chemical nature of the targets (Graves, 1999; Lockhart and 
Winzeler, 2000; Heller, 2002). To begin, it is necessary to distinguish between DNA and RNA targets which differ in 
two respects: First, single stranded (ss) RNA exhibits pronounced secondary structure (loops, hairpins, etc.) which is 
largely absent in ssDNA. Second, the hybridization free energy of RNA-DNA complexes is higher than that of DNA- 
DNA ones. For DNA samples, it is further necessary to distinguish between samples of double stranded (ds) DNA, as 
obtained from symmetric PCR amplification, and ssDNA samples as obtained, for example, using Lambda exonuclease 
digestion. The hybridization isotherms of the two types of samples are different (Halperin et al., 2004a). The labelling 
of the targets can also affect the hybridization behavior (Naef and Magnasco, 2003). Our discussion concerns samples 
of ssDNA targets assuming ideal labels that do not interfere with the hybridization. It focuses on the role of two 
quantitative characteristics of the sample: the number of bases in the target, N, and the molar concentration of the 
target, c t . N is determined by the choice of primers used for the PCR amplification or by the fragmentation step 
in the sample preparation. Note that the products of the PCR are monodisperse while the fragmentation introduces 
polydispersity in the size of the targets. In this last case it is only possible to control the average size of the targets. 
Typical reported values for PCR products vary in the range 100 < N < 350. The average N resulting from the 
fragmentation procedure is not always specified but the range 50 < N < 200 is representative. It is useful to note 
another distinction between the two procedures. Targets produced by PCR often have the hybridization site situated 
roughly in the middle of the target. In the case of fragmented targets, the location of the hybridization site is no 
longer controlled. With regard to c t it is helpful to stress the distinction between bioanalytic experiments, utilizing 
DNA chips to interrogate biological samples (See for example: Prix et al., 2003), and physical chemistry experiments 
aiming to understand the function of DNA chips (See for example: Peterson et al., 2002). In biology experiments 
Ct is a priori unknown since it is set by the biological sample and its treatment. In marked contrast, in physical 
chemistry experiments the target concentration is imposed by the experimentalist as is the composition of the sample. 
In such experiments the target used is often identical in length to the probes, n = N. As noted earlier, our analysis 
is motivated by bioanalytical experiments where N ^> n. 

The hybridization conditions include the composition of the hybridization solution, the hybridization temperature, 
T, and the hybridization time, th- Typical hybridization temperatures vary over the range 30°C < T < 60°C 
depending on n and the GC fraction. The hybridization times also vary widely with typical values in the range of 
2ft < th < 16ft. In most cases the hybridization solution contains 1M of NaCl. 

Different hybridization regimes are possible, depending on the values of n, N and S . To distinguish these regimes, 
it is necessary to first specify the molecular length scales of ssDNA and dsDNA. These are well established for dsDNA 
(Cantor and Schimmel, 1980). In the range of parameters considered, dsDNA is a rod- like molecule with each base 
pair contributing 3AA to its length. The radius of dsDNA is 9.5A and its cross section area is 284A 2 . We will limit 
our analysis to So > 284A 2 in order to avoid discussion of steric hindrance to hybridization. The corresponding 
characteristics of ssDNA are not well established. A typical value of the monomer size is a = 6A (Smith et al., 1996; 
Strick et al., 2003). The cited values of the persistence length, l p , vary between l p = 7. 5 A and l p — 35 A (Mills et 
al., 1999). ssDNA is often described as a random coil though long range interactions are expected to give rise to 
swollen configurations (Turner, 2000). In the following we will consider ssDNA as a swollen coil characterized by its 
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Flory radius (Rubinstein and Colby, 2003). This choice is dictated by our treatment of the brush, where the Flory 
radius emerges as a natural length scale. Accordingly, an isolated unhybridizcd probe occupies a hemisphere of radius 
rp ~ n 3 ' 5 a while a terminally hybridized target occupies a hemisphere of radius Rp <~ (N — 

n )3/5 a ^ 7v 3 / 5 a. As we 

shall discuss, the unhybridized probes do not interact when rp < S . Similarly, when Rp < S there is no brush 
regime. It is thus possible to distinguish between three different scenarios. A Langmuir regime is expected when 
So > Rp > rp. Brush effects, with no interactions between the probes, will occur when rp < So < Rp. Finally, 
when So < rp < Rp both the brush effect and probe-probe interactions play a role. All three scenarios occur in the 
reported variety of DNA chips. 

In the following we consider the role of n, N and So in bioanalytical experiments. For brevity we focus on 
the simplest among the experimentally realistic situations. Thus, we consider monodispersed ssDNA targets and 
monodispersed oligonucleotide probes. This avoids complication due to unspecified polydispersity and to competitive 
bulk hybridization. It is convenient to concentrate on the rp < So < Rp case with N n. As we shall see, this 
makes for a simpler discussion of the brush effects. It also allows us to ignore small corrections due to probe-probe 
interactions. Finally, our analysis assumes DNA chips with a passivated surface and probes anchored to the surface 
via short, flexible spacer chains. 

Our analysis is concerned with the modifications of the hybridization isotherm and rate equations as So decreases 
from the Langmuir range, So > Rp > rp, into the brush regime, rp < So < Rp. To implement this program, it is 
helpful to identify a reference state. In the following we utilize a probe layer that approaches the bulk values for the 
hybridization rate and equilibrium constants. We argue that this is the case when the following conditions are satisfied. 
First, the surface is perfectly non-adsorbing to both ss and ds DNA. Under these conditions adsorbed states are not 
involved in the hybridization reaction and the two state approximation for the hybridization reaction is justified. 
Second, the probes are attached to the surface via long, flexible and neutral spacers. We argue that the effect of the 
surface diminishes as the length of the spacers increases. Note that the spacers modify two effects. One is the steric 
hindrance that occurs when the probes are directly attached to the surface. The other is the reduction in the number 
of accessible configurations in the vicinity of an impenetrable planar surface. Idealy, the reference state involves 
spacer chains that do not interact with either the probes and the targets. The third condition is that the distance 
between the anchored probes ensures zero probe-probe interaction energy, irrespective of their hybridization state. 
For this reference state, the equilibrium hybridization constant at the surface K pt approaches Kp t , the equilibrium 
hybridization constant for the bulk reaction between the free chains. Accordingly, the hybridization isotherm in the 
small spot limit, when the hybridization at the surface has a negligible effect on initial molar concentration of the 
target c t , is 

Y^T=K pt c t . (1) 

It is important to distinguish between K pt and 

*.°=^(-#)- < 2 » 

where AG° t is the molar standard hybridization free energy as obtained from the nearest neighbor model (SantaLucia 
and Hicks, 2004), T is the temperature and R — 1.987 cal.mol^ 1 .K~ x is the gas constant. First, K® t and AG° t as 
calculated from the nearest neighbor model are identical for all N > n + 2. They allow, at most, for the effect of two 
dangling ends. Second, this model incorporates only nearest neighbor interactions along the backbone of the chain. 
It thus assumes that the oligonucleotide adopts the configuration of an ideal random coil. In particular, AG° t does 
not account for excluded volume interactions between the monomers. In addition, AGp t clearly does not allow for 
the effect of the impenetrable wall or for the interactions between the hybridized targets or between them and the 
neighboring probes. These additional terms and their effect on the hybridization isotherm will be discussed in the 
following three sections. 

Our choice of the parameters used in the numerical calculations is based on two experimental systems. One, of 
Guo et al. (1994), utilized probes of length n = 15 with PCR produced targets of length N = 157 or 347. Both 
ssDNA and dsDNA were investigated, the area per probe was varied in the range 300A 2 < S < 3000A 2 and the 
hybridization was carried out at T = 30°C. The hybridization times varied with N being th = 2 — 3h for N — 157 and 
th = 6 — 8h for N = 347. Note that in this study some of the data corresponds to the So < rp < Rp regime where 
probe-probe interactions are not negligible. The second system is the Affymetrix GcncChip E. Coli Antisensc Genome 
Array (Affymetrix, 2004). In this case, probes of length n — 25 hybridize with fragmented, thus polydispersed, ds 
cDNA targets with average length in the range 50 < N < 200. The hybridization is carried out at T = 45° G for 
th = 16/j. A rough approximation of S for Affymetrix chips was obtained from the estimated density of functional 



5 



FIG. 1: A schematic picture of the hybridization oflong targets at a layer of short probes. For simplicity we depict the case 
of targets with a terminal hybridization site, when each hybridized probe carries a long ssDNA tail. Three regimes occur: a) 

1/2 

In the 1 : 1 regime the distance between the probes, E , is large and each hybridized target can only interact with its own 
probe. There is no crowding of the tails, b) In the 1 : q regime the probe density is higher. At low hybridization fraction 
each target interacts with q — RpfEo probes, c) As the hybridization fraction increases the hybridized targets begin to crowd 
each other thus forming a brush with an area per chain R 2 F > E > Eo. Note that in the general case the hybridization site is 
situated roughly in the middle of the target and each hybridized probe carries two tails (d). 



groups in the substrate prior to the synthesis of the probes: 27 pM / cm 2 , and the step-wise yield of the synthesis, 
~ 90%. Only lApM/cm 2 attain n > 6 (Forman et al, 1998). This estimate yields E > 1200i 2 . In both systems 
the hybridization was carried out in a solution containing IM of NaCl. The base sequence of the probes considered 
in the calculations and their thermodynamic parameters for hybridization, as calculated using the nearest neighbor 
model with a perfectly matched target (SantaLucia, 1998; Peyret et al., 1999; HyTher™), are specified in Table 1. 

Table 1: The thermodynamic parameters utilized in the numerical calculations correspond to two probes: (i) 
The n = 15 wild type probe pi (5' - CGTCCTCTTC AAGAA - 3') incorporates the codon 406 of exon 4 
of the human tyrosinase gene. The N = 157 and 347 targets incorporate the perfect complementary seg- 
ment 5' - TTCTTGAAGAGGACG - 3' (Guo et al., 1994). (ii) The Affymetrix E.Coli Antisense n = 25 
probe p2 annotated AFFX-BioB-5_at:242:77, with interrogation point 177, corresponds to the sequence 5' — 
AGATTGCAAATACTGCCCGCAAACG - 3'. The fragmented cDNA targets incorporate the perfect complemen- 
tary sequence 5' - CGTTTGCGGGC AGT ATTTGC AATCT - 3'. The parameters are calculated from the nearest 
neighbor model (SantaLucia, 1998; Peyret et al., 1999; HyTher™) using the HyTher™ program with a 1M NaCl 
salt concentration. Since the targets are longer than the probes two dangling ends are invoked. 

probe Afl£ ASj t _ AG° pt (30°C) AG° t (45°C) 

kcal.mol 1 cal.mol .K 1 kcal.mol 1 kcal.mol 1 
pi -121.00 -334.06 -19.73 -14.72 
P 2 -203.30 -546.32 -37.69 -29.49 



BACKGROUND ON POLYMER BRUSHES 



Polymer brushes are formed by chains with one monomer anchored to a planar surface (Milner, 1991; Halperin 
et al., 1992). In the simplest case, the anchoring moiety is the terminal monomer. When the area per chain, S, 
is large the chains do not crowd each other. In this "mushroom" regime, the chains may be roughly considered as 
occupying hemispheres whose radius is comparable to the Flory radius of the free chain, Rf- When the surface density 
increases such that £ < R F , the chains begin to crowd each other thus forming a "brush". In the brush regime the 
chains stretch out along the normal to the surface so as to decrease the monomer concentration, c, and the number of 
repulsive monomer-monomer contacts. A simple description that captures the leading behavior of brushes is provided 
by the Alexander model (Alexander, 1977; Milner, 1991; Halperin et al., 1992). Within it the concentration profile 
of the brush is modeled by a step function of height H i.e., c = N / HYj at altitudes up to H above the surface and 
c = for higher altitudes. All the free ends arc assumed to straddle the outer boundary of the brush at height H . In 
the following we will use the Flory-style version of the model, ignoring scaling corrections. The regime of validity of 
this meanfield approach for semiflexible chains is expanded in comparison to that of flexible polymers (Birshtcin and 
Zhulina, 1984). This justifies the use of the "Alexander-Flory" model where the free energy per chain in a brush is 

G N 2 H 2 H 

— =v 1 In (3) 

kT Y,H Nalp Na y ' 

where k is the Boltzmann constant. The first term allows for the monomer-monomer interactions. It is of the form 
vc 2 V c hain where v is the second virial coefficient and V c hain — is the volume per chain. The second accounts 
for the entropy loss incurred because of the stretching of a Gaussian chain, comprising of Na/l p persistent sequences 
of length l p , along the normal to the surface. Here a is the monomer size, l p is the persistence length of the chain 
and the span of the Gaussian unswollen coil is Ro — -J 'Nal p . The last term arises because the impenetrable surface 
carrying the anchoring site reduces the number of accessible configurations of the tethered chain. For a Gaussian 
chain with a free end at altitude H the number is reduced by a factor of HI p /Rq (DiMarzio, 1965). This contribution 
is often ignored because it has a negligible effect on the equilibrium dimensions of the chains. It leads however to a 
significant modification of the hybridization constant at the surface. The last two terms of Eq|3] apply, in this form, 
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when Na 3> l p . We have omitted a term allowing for the entropy associated with the placement of the free end. This 
is because the Alexander model assumes that all free ends are constrained to the brush boundary. For simplicity we 
ignore, here and in the following, numerical factors of order unity. Minimization of G with respect to H yields the 
equilibrium values of Gbrush and H 
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kT 
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In the mushroom regime, the chains occupy a hemisphere of radius 



a \ a I \a A J 



Accordingly, the free energy per chain in the mushroom regime, G mus h , is set by the requirement G mus h 
the mushroom-brush boundary when £ = R F and H = Rp thus leading to 
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As noted earlier, the properties of the chains in the mushroom regime are comparable to those of free coils. In turn, 
the free coil behavior is specified by the free energy (De Gennes, 1979) 
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kT 



N 2 
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(8) 



leading, upon minimization with respect to the radius r, to Rp as given by Eq^Jand to the equilibrium free energy 
of a coil 



G, 



kT 



/ \ 3/5 



V \2/5 



(9) 



The difference between G mU sh and G co a is due to the logarithmic correction — hi(Rp/Na) arising from the wall effect. 

Within the approach described above, the nature of the grafted chain is specified by three parameters, the monomer 
size a, the persistence length l p , and the second virial coefficient associated with monomer-monomer interactions v. 
For the case of a brush formed by polyelectrolyte chains in aqueous solution of high ionic strength, "high salt" , v can 
be approximated by (Pincus, 1991; Appendix A) 



2tt 



1 



T 



2ttIbt 2 d . 



(10) 



The first term allows for the hard core repulsion between the monomers and for a weak, long ranged, van der Waals 
attraction between them. Here is the theta temperature where v of a neutral chain vanishes thus leading to 
the behavior of an ideal Gaussian coil. This term by itself is used to describe the behavior of neutral polymers 
(Rubinstein and Colby, 2003). The second term arises from the screened electrostatic interactions between the singly 
charged monomers. Here Ib = e 2 /ekT is the Bjerrum length (Evans and Wennerstrom, 1994) where e is the dielectric 
constant, k the Boltzmann constant and T the temperature. In water, with e ~ 80 at room temperature, Ib — 7 A. 
Note that the variation of e with T contributes to the T dependence of Ib- The Debye length rp, characterizes the 
range of the screened electrostatic interactions in a salt solution (Evans and Wennerstrom 1994). For a 1 : 1 salt with 
number concentration of ions c s , rp, = 1/\/8ttIbc s thus, in a 1M solution rp = 3 A. In our model, the presence of 
the 2irlBrp, term in v distinguishes polyelectrolyte brushes from neutral ones. It is important to stress the limitations 
of approximating v by Ea llOl It corresponds to the interaction between individual charged spherical monomers. For 



cylindrical non-charged monomers v ~ rather than 



(Rubinstein and Colby, 2003). Furthermore, this 



description does not allow for the contribution of Hydrogen bonds with water nor for the effect of correlations on 
the electrostatic interactions. Finally, the appropriate 9 temperature remains to be determined. With these caveats 
in mind, the second term is roughly comparable to 2-7ra 3 /3 and should be dominant for T > 9. As a result v is 
comparable to 2na 3 /3 and the swelling behavior of the chain is similar to that of a neutral chain in an athermal 
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solvent (De Gennes, 1979). In other words, even short chains swell to their Flory radius. We should add that by using 
v ~ 2irlBr 2 D we are able to recover our earlier results (Halpcrin ct al., 2004a) for the case of n = N (Appendix A). 

In the Flory type approach, described above, the equilibrium state is determined by a global balance of the osmotic 
pressure of the monomers and the restoring elastic force of the stretched Gaussian chains. A more refined analysis of 
the brushes, utilizing self consistent field (SCF) theory, is possible. This avoids the assumptions of uniform stretching 
and step- like concentration profiles. It yields the same functional forms for the characteristic height, H, and for Gbrush 
but with somewhat different numerical prefactors. With these reservations in mind we utilize the simplest approach, 
described earlier, because it typically yields the correct leading behavior in similar systems. A SCF theory is necessary 
for the description of effects that depend strongly on the details of the concentration profile and the distribution of 
the free ends. 

Our discussion thus far concerned brushes anchored to the surface by the terminal head group. In DNA chips 
the situation is often different in that the hybridization site, the anchoring functionality, is located roughly at the 
middle of the target. As a result, each hybridized probe carries two unhybridized tails (Fig. Id) of length Ni and 
N 2 = Ni(l + a) such that N± + N 2 + n = N. In considering the effect of this feature note that, in the brush regime, 
the details of the anchoring functionality are screened with a distance E 1 / 2 from the surface. As a result, it is possible 
to estimate the modification of Gbrush and H in two cases, Aq — N 2 S> n and N 2 ^> Aq » n. When Aq — N 2 the 
resulting brush is similar to that formed by chains of length N/2 but with an area per chain E/2. In this case Gbrush 
is larger by a factor 2 2 / 3 ~ 1.6 while H is smaller by a factor 2 2 / 3 in comparison to the values found for a brush of 
terminally anchored chains of length N and area per chain E. In the limit of N 2 ^> Aq ^> n the resulting brush may 
be considered as bidispersed, comprising an equal number of chains of length Aq and N 2 . Such a bidispersed brush 
can be described as a superposition of two brushes (Birshtein et al., 1990). A simple two layer model incorporates 
an inner brush of chains of length Aq and area per chain of E/2 and an outer brush formed by chains of length 
N 2 — Aq — aNi and area per chain E at the distal boundary of the inner brush. Within the Flory approximation this 

scheme leads to Gbrush — Gbrush and H = a + 2 2 ' H where Gbrush and H correspond to a monodispersed brush 

of chains of length N with an area per chain E. Note that a — corresponds to Aq = N 2 while a> 1 to JV 2 > JV], 
In both cases, the effect is to modify Gbrush and H, as obtained earlier by a multiplicative factor of order unity. In 
keeping with our policy we will omit these numerical factors in the interest of simplicity. 



The preceding discussion of brushes allows for the interactions among the hybridized targets and the effects of 
the impenetrable wall. However, the brush regime is only attainable when the hybridized targets can interact with 
neighboring probes, thus giving rise to an additional contribution to the free energy of the system. In discussing 
the target-probe interactions it is useful to distinguish between three regimes. When E > R F > r F the hybridized 
targets can not crowd each other. Roughly speeking, each one may be considered to occupy a hemisphere of radius 
Rf containing a single probe that is hybridized to the target (Fig. la). Since each target interacts with a single probe 
we will refer to this regime as 1 : 1. Our principle interest is in the two regimes that occur when Rp > E > r F . 
When the hybridization degree x is sufficiently small each target will occupy, as before, a hemisphere of radius Rf- 
However, it will now interact with q = R f /T,q probes (Fig. lb). We will thus refer to this regime as 1 : q. Note that 
in the polymer science nomenclature both 1 : 1 and 1 : q regimes fall into the "mushroom" range, when the tethered 
chains do not overlap. The brush threshold occurs at x — xb when the hemispheres occupied by the different targets 
come into grazing contact. For a surface of total area At the area per hybridized target is E = At/xNt = So/a; 
where Nt is the total number of probes, xb corresponds to E = Rp or xb = 'So/Rp = 1/q. When x exceeds xb the 
hybridized targets begin to overlap thus forming a brush (Fig.lc). Since the area per chain in this regime decreases 
as E = E /x the target experiences interactions only with x^ 1 < q probes. 

To estimate the free energy of interactions between the target and the probes, in the spirit of the Flory approach, we 
assume that each probe contributes an interaction free energy Gi n t/kT = vnc. Here c is the monomer concentration 
within the monomer cloud formed by the hybridized targets i.e., we assume the interaction with the probes does not 
affect c as obtained in our earlier discussion of the mushroom and brush regimes. As we shall elaborate later, this 
assumption is justified only when Gi nt <C G co u(N) or 



TARGET-PROBE INTERACTIONS 




(11) 



In the 1 : 1 regime each hybridized target occupies a hemisphere of radius Rf incorporating a single probe. Accordingly 
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G\£ t /kT = vnc with c = N/R% thus leading to 



kT N 4 / 5 \l 



(?) ■ < 12 ) 



This estimate is reasonable when N ^> n such that the region occupied by the unhybridized target is sufficiently large 
so as to encompass the hybridized probe. Roughly speaking, this implies (N — n) 3 ^ 5 a > 3.4 nA. Within the 1 : q 
regime each hybridized target interacts with q — RpfEo probes. Accordingly G^/kT = v(Rp/T, )nc with c = N/Rp 
or 



kT lEo 



G\^ t and G\^ t are independent of x. In marked contrast Gf nt , accounting for the target-probe interactions in the 
brush regime, varies with x. This variation arises because of the x dependence of the monomer concentration within 
the brush, Cb rU sh — N/HH where £ ~ 1/x and H ~ a; 1 / 3 . Gbrush(x) is obtained from Eq^upon replacing £ by T,q/x. 
Within the Flory approach the total interaction free energy between the targets and the probes is vNxncbrush- The 
interaction free energy per hybridized target is thus vncbrush / x or 

Gf nt vnN 1/3 / a 2 \ 2/3 / a \ 1/3 / v \ 2/3 

— — = = nx^ i/d 

kT Z H 



k) (i) (?) ' <"> 



The condition Ea llll ensures that the interaction term G^L is a weak perturbation to the Flory free energy of 
the mushroom G mus h(N). When this requirement is not satisfied the chain span exceeds the Flory radius. This is 
an unphysical result since the interactions driving the extra swelling are confined to the surface. In this case the 
chain can no longer be assumed to occupy a hemispherical region encompassing the probes. The uniform monomeric 
distribution inherent to the Flory approach should be refined so as to reflect locally stretched configurations allowing 
to avoid the probes. For simplicity we will not consider this regime. 



BRUSH EFFECTS-THERMODYNAMICS OF HYBRIDIZATION 



Having obtained the free energy terms associated with target-target and target-probe interactions at the surface, 
we are in a position to investigate their effect on the hybridization isotherm. To simplify the equations we set v = a 3 
and l p = a. The hybridization isotherm is determined by the equilibrium condition of the hybridization reaction 
p + 1 ^ pt at the probe layer that is /x pt = fJt p + fJ-t where fii is the chemical potential of species i. Here p and t signify 
single stranded probe and target while pt is the hybridized probe-target pair. We first consider /i t . In practice, the 
molar concentration of the targets, Ct, is only weakly diminished by the hybridization reaction and it is reasonable 
to assume that Ct is constant. The generalization to the opposite case, when this small spot approximation fails, is 
straightforward (Halperin et al., 2004a). Since the target solution is dilute and the ionic strength of the solution is 
high, electrostatic interactions between the targets are screened. Consequently fi t assumes the weak solution form 

IH = LL° t +N Av Gcoti(N) +RT In c t (15) 

where /i° is the chemical potential of the standard state of the hybridization site and Na v is the Avogadro number. 
We choose a standard state such that /i° t — /i° — = AG° t as given by the nearest neighbor method. As discussed 
earlier, this implies a standard state having an ideal coil configuration. When the hybridization site is within the 
target, it also reflects the contribution of two dangling ends. G co u(N), as given by Eq[7| allows for the swelling of 
the free coil due to excluded volume and electrostatic interactions. Strictly speaking, fit = fx® + NavG co u + RT In at 
where at is the activity (Moore, 1972). The dimensionless a t is related to the molar concentration of targets Ct via 
at = JCt where 7 is the activity coefficient. Since 7 — > 1 as Ct — > we will, for simplicity, express /it by Eg 1 151 noting 
that the molar ct in this expression is dimensionless. 

It is useful to first specify K pt of the reference state corresponding, as discussed in Design of Oligonucleotide 
Microarray Experiments, to Kp t of the bulk reaction p + t f± pt where p denotes a free probe chain. To this end we 
need 

= Mp + N Av G col i (n) + RT In c ¥ (16) 

and 

fipt = /x° t + N Av [G coU {N - n) + da] + i?Tln C p t . (17) 
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The cquilibium condition \i t + \Xp = (j,p t yields Kp t = K pt with 



K 



pt 



exp < - 



AG° pt + N Av [G coll (N -n) + Gj* - G coll {n) - G coll {N)] 



RT 



(18) 



,1/5 _ 



]V4/5 



where K pt = exp(—AG pt /RT) and AG° 4 = fi pt — n p — K pt > K pt because the hybridization results in the 
formation of a rodlike ds domain whose monomers experience only short-range interactions with each other but also 
long-range interactions with the monomers of the unhybridized ss tails. 

The chemical potentials fj, p t and \x p are specified by the free energy per probe site of the surface, ^ S ite- In the 1 : 1 
regime, when S > R 2 F > r F , there is no mutual interaction between the probes or between the targets. The molar 
free energy of probe sites is 



Isite = 7o + x [ult + N Av G mush (N - n) + N Av G£ t ] +(l-x) [fjP p + N av G mush {n)} - TS[x\. 



(19) 



Here 70 is the free energy density of the bare surface while fi pt and /j p denote the chemical potentials of the hybridized 
and non-hybridized probes in the standard state. As noted before, the standard state of p is an ideal coil with 
no excluded volume interactions. The two G mus h terms allow for the excluded volume and screened electrostatic 
interactions as well as for the effect of the impenetrable wall. G mus h{N — n) accounts for the monomer- monomer 
interactions of the unhybridized tail of pt while G mU sh{n) allows for the contribution of the unhybridized probe. 
G\£ t reflects the electrostatic and excluded volume interactions between the hybridized target and its own probe. The 
mixing entropy per mole of p and pi sites is S[x] = — i?[xlnx+(l — x) ln(l— a;)]. The equilibrium condition (x pt = Hp+Ht 
can be expressed in terms of the exchange chemical potential of the hybridized probe, pf pt = fi p t — ji p = dj S it e /dx, as 
fi p t = [i t - The hybridization isotherm, thus obtained, assumes the familiar Langmuir form 



ctKp-'t 1 = c t K° t exp 
2/5 



G mush (N -n) + G}£ - G mush (n) - G coil {N) 



kT 



(20) 



n y 

n) 



Kpj. 1 is smaller than K pt 



because of the effect of an impenetrable wall giving rise to the (n/N) 2 ^ 5 factor reflecting the 
reduction in the number of configurations available to the unhybridized tail of pt. 



In the S > 



R F 



> ri 



range the hybridization behavior is independent of x. As noted earlier, an x dependence is 



expected when R F > £0 > r F . We first discuss the 1 : q regime occurring when x < xb- Jsite in this range is similar 
to the one describing the 1 : 1 regime. The only difference is the replacement of G\^ t by G^, thus allowing for the 
interactions between a hybridized target and q > 1 probes. The hybridization isotherm as obtained from ^ = fj, t is 



c tK p t q = c*^° t exp 



G mus h(N -n) + G\£ t - G m ush(n) - G coil (N) 



kT 



(21) 



CtK pt (£) 



2/5 



exp 



7V4/5 



(9-1) 



As in the 1 : 1 regime, the hybridization isotherm is of the Langmuir form. The equilibrium constant, K pt 9 is however 
smaller than K p ^ because G in ' is larger than G\£ t by a factor of q = R F /T, = N 6 ^ 5 a 2 /E . 

When S < R 2 F or x > xb = T.q/R 2 f ~ N~ 6 ^ 5 Y, /a 2 the hybridized targets begin to crowd each other and form a 



brush. This crossover occurs at x eq — xb corresponding to 

So 1 S 1 



CB = 



Ri 



V.q 



Rl - S K. 



N 

P t \ n 



2/5 



exp 



(9-1) 



U F ^» "pt F 

jsite of the brush regime, 

l sl te = 70 + x\pP pt + N Av G b rush(x) + N av G? nt (x)} + (1 - x)\nl + N Av G mush (n)} - TS[x], 



(22) 



(23) 



is distinctive in two respects. First, G mus h{N — n) is replaced by an x dependent free energy of a chain in a 
brush, Gbrush(x). Second, the term allowing for the target-probe interactions, Gf nt (x), is also a function of x. The 
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hybridization isotherm, obtained as before, is 



x 



= c t KE.(x eq ) = c t K°e^p 



l — x e 



(n)-G coil (N) 

kT 



(24) 



* c t K pt (^) 1/3 ex P { ^ - «3 - ^ + n.-V] (|) ^ j . 

The TV 1 / 5 term, arising from G co u(N) is expressed as N x 7 ^ 3 (a? /Hq) 2 / 3 to underline the crossover behavior at xb- By 
construction, this isotherm is only meaningful when q > cb so that x > xb- It deviates strongly from the Langmuir 
form because of the x dependence of Gb rU sh and Gf nt . 

The complete "long tail" hybridization isotherm for the r 2 F < Eo < Rp case is obtained from Ea 12 II and Eal24l In 
this isotherm, as in the interaction free Langmuir isotherm (EqJQ, x eq — > 1 as c t increases. However, the two scenario 
differ strongly with respect to the range of Ct involved (Fig. 2). The saturation in the long tail case occurs at a much 
higher ct- When x eq vs ct curves of the two scenario are compared over a limited ct range (Fig. 2a), the long tail 
isotherm is superficially similar to a Langmuir isotherm but with apparent saturation at x eq < 1. A plot of x eq vs 
log Ct (Fig. 2b) is necessary in order to visualize the differences in the saturation behavior. 

A useful measure of the sensitivity of the DNA chip is the C50 corresponding to the target concentration, ct, 
needed to obtain at equilibrium x eq — 1/2 (Halperin et al., 2004a). The C50 also provides a rough estimate for the 
onset of saturation, as discussed earlier. In the 1 : 1 regime, where the hybridization follows a Langmuir isotherm, 
c lo = 1/Kpt~- When R 2 F > £0 > rp, we can distinguish between two scenarios. So long as x B = ^ /R 2 F > 1/2, 
x eq — 1/2 is attained before the onset of the brush and c\q = 1/K^ . In the opposite case, xb = T, /R F < 1/2, 
x eq — 1/2 occurs in the brush regime and cf = 1/Kp t (x eq = 1/2). These corresponding experimental guidelines 
assume a more useful form when considering the logarithm of C50. In particular, these relate the range of expected 
target concentrations Ct, as given by or cf 0) to AG pt , n, N and So 



50 ul L 50i bu "^pti 

Ar<o 

l:q _ 



l n 4* = ^ + H m ^ +iV 2 / 5 n— -n 1 ' 5 , (25) 
50 RT 5 n E V ' 



^ 5 o can be significantly higher than Cgg 9 , 

In ^ = m - 2 2/3 4 /3 ) + M ( £-) - N^nf + ± In » 1, (27) 



since it is dominated by the factor exp[7V(l — 2 2 / 3 a; 2 / 3 )(a 2 /2Eo) 2 / 3 ]. It is helpful to compare Eal2*Kland Ea l2"fil with 
the Langmuir isotherm of the "reference" state, Eq^ where Cg = 1/K pt . The guideline obtained, following the same 
procedure, is 

mc O o = ^k + » _„l/6. (28) 

RT iV 4 / 5 

In this case Cg is determined by n, N and AG pt /RT. In marked contrast and cf Q depends explicitely on So- 

The strong N dependence of cf , as compared to c\q and c® , is illustrated in Fig. 3. The increase of c^j signals a 
corresponding loss of sensitivity. 

To utilize these guidelines one needs AG pt as calculated using the nearest neighbor model. However, to highlight 
the role of n as a design parameter, it is helpful to use the Wetmur approximation (Wetmur, 1991) where average 
values of the nearest neighbor contributions are utilized. Accordingly, AG pt of perfectly matched probe-target pair, 
when the hybridization site is located within the target, is approximated by 

AG° pt = (n - l)AG nn + AG, + 2AG e (29) 

where AG nn , AG; and AG e are the average values corresponding to a nearest neighbor pair, an initiation step and 
a dangling end. Wetmur estimated the nearest neighbor contribution by AH nn = — 8.0 kcal.mol~ x and AS nn — 
— 21.5 cal.mol -1 .K -1 , the initiation term by a temperature independent AGj = 2.2 kcal.mol -1 and the dangling end 
contribution by AH e = —8.0 kcal.mol -1 and AS e = — 23.5 cal.mol -1 .K~ 1 . Note that while useful, the Wetmur 
approximation erronously predicts identical AG^ t for all pt pairs with N = n. 




FIG. 2: The hybridization isotherms as calculated using Eg 12 II and Eg 1241 for the probe target pairs utilized by Guo et al.(1994) 

with E = 2500i 2 and T = 30°C. N = 157 ( ), N = 347 ( ) and the reference state case calculated from EqE]with 

N — 147 ( ■ ■ ■ )• The x eq vs c t curves are depicted in a) for the range < ct < lpM while x eq vs log c t plots are depicted in b) 
(—9 corresponds to nM). 



BRUSH EFFECTS-KINETICS OF HYBRIDIZATION 

Having obtained the equilibrium constants K^'j. 1 , Kp t q and K^ t {x) for the hybridization at the surface we are now in 
a position to consider the corresponding rate constants. To this end we will assume, and later confirm, that the rate 
is reaction controlled. Again, for simplicity, we set numerical prefactors to unity, v — a 3 and l p — a. It is necessary 
to recall first the relevant features of the kinetics of oligonucleotide hybridization and of the desorption of polymers 
out of a brush. 

As discussed in Design of Oligonucleotide Microarray Experiments, the reference state of our analysis is a layer 
of non-interacting probes bound to a passivated surface by long flexible spacers. We assume that the molecular 
mechanism of hybridization in this case is identical to the bulk one and that the kinetics follow the Langmuir rate 
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FIG. 3: Plots of log eg, vs N for the probes utilized by Guo et al. (1994) with E = 2500i 2 ( ) and S = 5000i 2 ( ). 

T = 30°C and n = 15. The reference state logcjjo is plotted for comparison ( • ■ ■ ). The circles correspond to the crossover 
between 1 : 1 and 1 : q regimes whereas squares correspond to the crossover between 1 : q and B regimes. 



law 

dx 

— = k h c t (l - x) - k d x. (30) 

In this regime the hybridization and denaturation rate constants, kh and kd, are independent of So or x and approach 
their bulk values. At equilibrium % = leading to K pt = kh/kd as required by detailed balance. In turn, the 
hybridization mechanism of free oligonucleotides in solution is thought to involve the steps outlined below (Craig et 
al., 1971; Porschkc and Eigen, 1971; Cantor and Schimmel, 1980; Turner, 2000). An approach and alignment of the 
single stranded oligonucleotides is followed by the hybridization of a single base pair. A stable nucleus, comprising 
of n c + 1 base pairs, is formed by step-wise addition of hybridized pairs. Importantly, a ds sequence of n < n c is 
unstable. Once n c + l is attained the ds domain is rapidly "zipped up". For oligonucleotides comprising GC base pairs 
n c ~ 2 — 3 and the hybridization rate constant exhibits the form kh — t^ 1 exp[— AG^/RT]. Here Th is a molecular 
time scale characterizing the formation of the last base pair of the nucleus while the activation free energy AGf 
reflects the formation of a ds nucleus of n c base pairs plus the activation free energy for adding the next base pair. 
Importantly, the reaction is not diffusion controlled but involves a number of activation barriers associated with a 
corrugated free energy profile (Turner, 2000). A rough estimate of AGjf within the Wetmur approximation (Wetmur, 

1991) yields AGf ~ n c AG nn + AGi + 2AG e indicating that AG^ depends on n c rather than n. This last point 
rationalizes a phenomenological result we will utilize later, namely kh in high ionic strength solutions is 

k h ~ H^M -1 .*- 1 (31) 

to within one order of magnitude and with a weak T dependence (Turner, 2000). This, together with the detailed 
balance requirement K pt = kh/kd yields 

k d ~ 10 6 exp[AG° pt /RT] s" 1 . (32) 

In terms of the Wetmur approximation k d is expressed as k d ~ t^ 1 exp[(n — n c )AG nn /RT]. The activation barrier 
for denaturation involves thus the break up of n — n c base pairs so as to form an unstable ds domain. Importantly, 
for 15 < n < 25, the denaturation life time at 37°G is measured in years. 

At this point it is of interest to comment on a result, obtained from computer simulations, concerning the kinetics 
of desorption out of a brush (Wittmer et al., 1994). It concerns a planar brush formed from flexible and neutral chains 
whose terminal monomer experience a short range attraction to the wall. The attraction was modeled as a well of 
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width a, a monomer size, and depth G we u. In this system the expulsion rate constant is 

k out =T- 1 (Z)cxp[-G weU /RT} 



(33) 



where r(E) is the time required by the head group to diffuse across a distance E 1 / 2 , corresponding to the inner most 
blob of the brush. Importantly, k out while E dependent was found to be independent of TV. Once the surface bond 
is broken, the expulsion of the chain out of the brush is driven by repulsive monomer-monomer interactions with 
neighboring chains. This last stage is a fast process and thus not rate controlling. The system studied by Wittmer et 
al. differs from ours in two respects. First, in this study the attractive potential is laterally invarient i.e., the surface 
is uniformly attractive. As a result, the reaction coordinate is the distance between the terminal end group and the 
surface z. In our case the attractive potential is localized at the immediate vicinity of the probe and the early steps 
of denaturation involve lateral separation of the two strands. Consequently the reaction coordinate at the vicinity 
of the surface is no longer z. Second, in the work of Wittmer et al., the barrier to adsorption is due to the brush. 
There is no barrier in the mushroom regime where the reaction is diffusion controlled. This is also the case in the 
brush regime when the terminal group resides within a distance E 1 / 2 from the surface. However, as noted earlier the 
hybridization reaction in the bulk is not diffusion controlled. Accordingly, one should consider the possibility that 
the rate of hybridization at the surface is similarly not controlled by diffusion. In such a case the denaturation rate 
constant, corresponding to k ou t, will be independent of both TV and E. 

In the following we will assume, and later confirm, that the rate of hybridization at the surface is reaction controlled 
rather than diffusion controlled. In quantitative terms, the assumption of reaction control involves two ingredients. 
First, the rate equation may be written as 



dx 



k h c t (z = 0)(1 - x) - k d x 



(34) 



where c t (z — 0) is the local concentration of target hybridization sites at the surface while kh and kd stand for the 
rate constants as observed in the solution. In microscopic terms this implies that the hybridization and denaturation 
reactions at the surface are respectively monomolecular and bimolecular and that the encounter probability between 
a probe and a target is proportional to c t (z = 0). Importantly it also implies that the free energy surfaces of the 
hybridization reaction in the bulk and at the surface arc identical. This last point is reasonable because this free energy 
surface reflects local reorganization of hydrogen bonds and stacking interactions (Turner, 2000). This assumption also 
implies that the lateral diffusion is fast enough so as to prevent inplane variation of c t (z — 0). The second ingredient 
is the assumption that, for any x, c t (z = 0) is equal to c\ (x), the equilibrium concentration of unhybridized terminal 
groups at the surface. In other words, the diffusion of chains is sufficiently fast in comparison to the hybridization 
reaction to ensure that a Boltzmann distribution is maintained. This condition is especially stringent in the brush 
regime, where inbound diffusion must overcome a potential barrier due to interactions with the previously tethered 
chains. The equilibrium condition requires that c\jct = exp(— Afi/RT) where Afi(x) is the difference between the 
chemical potential of a fully inserted chain and a free one. Accordingly, for each of the three regimes 



K pt 



i = l:l,l:q,B. 



(35) 



Note that within our treatment numerical prefactors are omitted and there is no distinction between the chemical 
potential and the free energy per chain. Altogether, the corresponding rate constants for the three regimes, i = 1 : 1, 
1 : q and B are 



kh 



K 



and 



(36) 



leading to 
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(9-1) 
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(37) 

(38) 
(39) 



The results above where obtained assuming that the hybridization rate is controlled by the reaction rather than by 
the diffusion towards the surface. To check the consistency of this approach we consider the corresponding Damkohler 
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number (Blanch and Clark, 1996) Da = Jreac/ Jdif ■ Here J reac and Jdif are the maximal fluxes associated with the 
reaction and the inbound diffusion, assuming reaction control. Reaction control implies Da <C 1. Jreac = fc/iCj/So 
is an upper bound on the reaction flux. The inbound flux of chain through the brush is Jdif — c\vbarrier where 
^barrier is the diffusion velocity of a single chain at the vicinity of the surface where the brush potential is essentially 
flat. Recent experimental results and a unified picture of theoretical models are presented by Titmuss et al. (2004). 
Altogether 

Da = kh (40) 

^O^barrier 

where Vbarrier = cekT ' / 'i]N a 2 . Here 77 is the solvent viscosity and a is a polymer specific numerical constant, a of ssDNA 
has not yet been determined but for flexible synthetic polymer a ~ 0.1. For water at 25°C rj = 0.89 x 10~ 3 N.m~ 2 .s. 
The Damkohler number at 25°C, when both fluxes are expressed in units of chains. m .s is 

N 

Da = 0.13 — (41) 
So 

where we assumed a = 0.1, k h = 10 6 M _1 .s -1 , a = 6A and expressed S in A 2 . For 100 < N < 600 and T = 25°C, 
the Damkohler number varies in the range 9 x 10~ 3 < Da < 5 x 10~ 2 when S = 1500^4 2 and 2.6 x 10~ 2 < Da < 0.16 
when So = 500^4 2 . The variation of water viscosity with temperature affects those ranges by at most a factor 2 for 
0°C < T < 70°C. Accordingly, the assumption of reaction control of the hybridization rate is justified for typical 
values of N and So. It will though fail evantualy for high N values. One should note that the issue of reaction vs 
diffusion also arise when the hybridization chamber is agitated and we will not discuss it further. 

As required the rate constants Eg 1371391 obey detailed balance and exhibit the proper crossover behavior. In 
particular, k' l h /kd = K l pt as well as k^(xs) — k]' q . The x dependence of k^ slows down the adsorption rate (Fig. 4). 
kh(xco) — k n /e is a possible measure for the onset of significant slow down. In the limit of N 3> 2n the x -1 / 3 term 
is negligible and the onset occurs roughly at 



1 /"E 



2/3 



+Xb 



-| 3/2 

* xb- (42) 



It thus affects the whole brush regime. The slower kinetics in the brush regime can affect the attained hybridization 
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FIG. 5: The hybridization fraction attained after th = 16 hours as a function of N for the Afnmetrix probe p2, n — 25, 
T = 45°C, c t = 0.1 nM with E = 2500A 2 ( ) and S = 5000i 2 (- - -). 

even after long hybridization periods (Fig. 5). This is of practical importance because samples of identical c t but 
different N will vary in their signal intensity. 

DISCUSSION 

The relative size of the targets and probes is an important characteristic of oligonucleotides microarrays. When 
the two are of equal size, N = n, the onset of interaction between the probes is roughly set by the span of the probes 
as determined by n. In biology experiments the targets are much larger, N 3> n, and the onset of interactions is 
controlled by N. The progress of hybridization can give rise to crowding of the non- hybridized tails when R 2 F > S . 
The polyelectrolyte brush thus formed affects the hybridization isotherm and the rate equations. In particular, it 
lowers both the hybridization rate and the attainable hybridization for a given concentration of targets. It is important 
to allow for this effect in the design of DNA microarrays, in the formulating of the protocols of sample preparation 
and hybridization as well as in the analysis of the results. With regard to design of DNA chips the brush effect is 
important in choosing the desired density of oligonucleotide probes, or equivalently So. The brush effect will lower the 
fraction of probes that actually hybridize. As a result, the benefits of increasing the surface density of oligonucleotide 
probes diminish when the intended targets are long. When S is set, these considerations suggest a criteria for tuning 
the length of the targets, N, as controlled by the choice of the PCR primers or of the fragmentation procedure. 
In particular, it is beneficial to shorten N so as to avoid crowding. When brush effects do occur the analysis of 
the results should allow for the ensuing deviations from the Langmuir behavior. This is an important point for the 
implementation of model based algorithms. 

Physical chemistry type experiments, that aim to investigate the function of DNA microarrays, tend to focus on 
the symmetric case, of A" = n. Our discussion highlights the merit of studying the kinetics and the equilibrium 
behavior in the asymmetric case, N 3> n. In this case it is of interest to correlate the hybridization behavior with 
measurements of the brush thickness. 

Our analysis focused on the case of ssDNA targets so as to avoid complications due to the secondary structure of 
RNA molecules. The importance of the secondary structure of RNA targets, as used in gene expression experiments, 
is yet to be established because the effect of labelling by biotin is not well understood. The effect of the fragmentation 
on the kinetics of hybridization suggests however that a crowding effect of some sort is indeed involved. 
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APPENDIX A: THE VIRIAL COEFFICIENT AND THE CASE OF A BRUSH OF RODS 



Consider first the second virial coefficient 

1 

V= 2 



[1 - exp(-£/(r) /kT)} 4irr 2 dr (43) 



for spherical monomers of radius a when their interactions are purely repulsive. In particular, the interaction potential, 
U(r), comprises a hard core repulsion together with a screened electrostatic repulsion, that is 

{oo for r < a, 

kT ^M-(r-a)/m] {oTr>a (44) 
r l + a/r D 

Here ro is the Debye screening length and Ib is the Bjerrum length (Fowler and Gugenheim, 1960). The hard core 
contribution to v is 27ra 3 /3. The electrostatic contribution, assuming that U/kT -C 1, is 2-kIbTu and altogether 

v= ^a 3 + 2irl B r 2 D . (45) 

If one supplements the electrostatic repulsion U by a weak van der Waals attraction the first term assumes the 
form 27ra 3 (l — 9/T)/3 where 8 is the theta temperature (Rubinstein and Colby, 2003) thus leading to Eq^| For 
0.1M of NaCl salt, m = 10 A and assuming a — 6A we find that the electrostatic term dominates. When the salt 
concentration is IM the screening length diminishes to ro = 3A and the two terms are comparable. 

In the case of probes and targets of equal length, n = N, the probe layer consists of a mixture of single stranded 
probes and hybridized, double stranded ones. The associated interaction free energy for this case can be obtained 
(Halperin et al., 2004a) upon assuming, following Korolev et al. (1998), that both adopt rod-like configurations of 
equal length L = nb where b ~ 3.4^4 is the contribution of a base (base pair) to the length of the rod. The hybridized 
probes are rod-like because a dsDNA is rigid on the length scales of a typical probe (10 < n < 30). Viewing the 
unhybridized probes as rigid rods is an approximation justified, for the short probes, by two related observations. One 
is the tendency of ssDNA to form rigid domains of single stranded helices due to stacking interactions (Cantor and 
Schimmel, 1980; Turner, 2000; Buhot and Halperin, 2004). The second is that the persistence length attributed to 
ssDNA is comparable to the length of the probes. It is important however to stress that the configurations of ssDNA 
are not yet fully characterized. As noted in Design of Oligonucleotide Microarray Experiments, the reported values of 
the persistence length of ssDNA vary over a wide range 7.5 A < l p < 35 A. Similarly, the thermodynamic parameters of 
the stacking interactions are not fully established. With these reservations in mind, this picture provides a convenient 
approximation because it allows us to assign to the probe layer a unique thickness, independent of x. In particular, 
the thickness of the probe layer is comparable to L. The interaction free energy density within the probe layer is 
accordingly, Fi nt = vc 2 where c = n(l + x)/Y>qL is the number concentration of monomers within the layer and the 
interaction free energy density per unit area is 



, , n 2 (l + x) 2 

j e l = 2nl B r D { ' . (46) 



Accordingly, the overall free energy per probe site 



Isite = 7o + xn° pt + (1 - x)fi° + S 7e;(^) + RT[x\nx + (1 - x) ln(l - x)} (47) 



"pt T ^ ^JH-p 

and the equilibrium condition /i t = fi p ^ = dj S it e /dx yields 

x, 



1 X eq 



c t ^ pt exp[-r(l + x e(? )] (48) 



with r = Ami 2 lBr 2 D /£q£, as obtained earlier using the box approximation for the solution of the Poisson-Boltzmann 
equation (Halperin et al., 2004a) with a different prefactor. The isotherm obtained above differs from the "brush 
isotherm" because the chain elasticity does not play a role and the layer thickness does not exhibit an x dependence. 



APPENDIX B: EFFECT OF CHAIN SELF- AVOIDANCE ON THE HYBRIDIZATION CONSTANTS 



The Flory approximation as used in the text overestimates both the elastic and interaction free energies. Another 
delicate point concerns the entropy of the free ends. At the same time, the Flory approximation is known to be robust 
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and its performance for the brush has been studied showing relatively mild deviation from the exact results obtained 
by SCF theory (Milner, 1991). With these points in mind it is of interest to confirm the results obtained utilizing the 
Alexander-Flory approximation by a more rigorous approach. In the following we present exact results concerning 
K . In particular, the alternative derivation allows for the chain self-avoidance while ignoring the small correction 
due to interactions between the hybridized ds domain and unhybridized tail of the target. To this end we utilize the 
partition function of a self-avoiding chain (Duplantier, 1989; Eisenrigler et al., 1982). The partition function Z co u(N) 
of a free self-avoiding chain of N monomers is 

Z C ou(N) = z N N't- 1 (49) 

where z is model-dependent effective partition function of a monomer, and 7 is a universal configurational exponent. 
For a self-avoiding chain with a terminal monomer anchored to an impenetrable planar surface, a "mushroom" , the 
partition function is 

Z m ush{N) = Z N N^- 1 (50) 

where 71 is a different universal configurational exponent. 

When a probe and a target hybridize, the ds domain can be envisioned as a rigid rod with a partition function 

Zrodin) = z? od = z 2 n exp[-AG%(n)/RT} (51) 

Here, z ro d is the partition function of a pair of hybridized monomers, zq is partition function of a single monomer in 
an ideal Gaussian coil, n is number of pairs in the ds domain, and AGp t (n) is the free energy difference between the 
rigid ds and ideal coil ss domains. The free energy G is related to partition function Z by G = —RT\n(Z). 

The hybridization constant K pt in a solution of targets and probes whose respective lengths are N and n <C N is 

K pt = exp{-[G ro d(n) + G coll {N - n) - G coil {N) - G coU (n)]/RT} (52) 

Using EqEni Eq[5l]and EalCT we obtain 

Z ro d(n)Z co u(N - n) _ (N-nX 1 ^ 1 



Z coi i (n)Z coi i (N) \ N 1 - ' 



2/i 



where K pt = exp[— AG pt /RT] as introduced earlier. 
For the hybridization at a surface in the 1 : 1 regime 

= exp{-[G ro d(n) + G mush (N - n) - G coil (N) - G mush {n) - 5S rod ]/RT} (54) 

Here, 5S ro d = m (/?) is the reduction in the rod entropy due to its attachment to the surface. The specific value of j3 
of the order of unity depends on the length and flexibility of the spacer. In the simplest case of a short flexible spacer, 
the surface eliminates half of space available to a free rod in the solution, thus yielding (3 = 1/2. Eg 1491 EalSUI and 
EqGHlead to 

a Z r od{n)Z mush {N - n) (N - n \ 71_1 , fz \ 2n , UPTl fKK , 

K P t = P— 7 — rr^ — rw^^Pi N \~) exp [-^GoW/rt] (55) 

1 Z m ush (n)Z con (Jy) \ n J \zJ 

-^(^"(fr 

The ratio of hybridization constants at the surface and in solution, as determined from Eg 1531 and Eg 1551 is 

K v't n fn\ 7-7i 



P(-) iV»n (56) 



K pt r \N 

The values of 7 1.167 and 71 rj 0.695 were obtained using field theoretical methods and numerical calculations 
(Duplantier, 1989; Eisenrigler et al., 1982). Therefore, 7 — 71 w 0.47 is in close agreement with Kh 1 /K pt = (n/N) 2 / 5 , 
Kqllll 
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